Search CORE

117 research outputs found

Modeling Cancer Progression via Pathway Dependencies

Author: Chi Jen-Tsan
Edelman Elena J
Febbo Phillip G
Guinney Justin
Mukherjee Sayan
Publication venue: Public Library of Science
Publication date: 01/02/2008
Field of study

Cancer is a heterogeneous disease often requiring a complexity of alterations to drive a normal cell to a malignancy and ultimately to a metastatic state. Certain genetic perturbations have been implicated for initiation and progression. However, to a great extent, underlying mechanisms often remain elusive. These genetic perturbations are most likely reflected by the altered expression of sets of genes or pathways, rather than individual genes, thus creating a need for models of deregulation of pathways to help provide an understanding of the mechanisms of tumorigenesis. We introduce an integrative hierarchical analysis of tumor progression that discovers which a priori defined pathways are relevant either throughout or in particular steps of progression. Pathway interaction networks are inferred for these relevant pathways over the steps in progression. This is followed by the refinement of the relevant pathways to those genes most differentially expressed in particular disease stages. The final analysis infers a gene interaction network for these refined pathways. We apply this approach to model progression in prostate cancer and melanoma, resulting in a deeper understanding of the mechanisms of tumorigenesis. Our analysis supports previous findings for the deregulation of several pathways involved in cell cycle control and proliferation in both cancer types. A novel finding of our analysis is a connection between ErbB4 and primary prostate cancer

CiteSeerX

Directory of Open Access Journals

PubMed Central

Correlation set analysis: detecting active regulators in disease populations using prior causal knowledge

Author: Chindelevitch Leonid
DeLisi Charles
Guinney Justin
Huang Chia-Ling
Kostrowicki Jarek
Lamb John
Ziemek Daniel
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Identification of active causal regulators is a crucial problem in understanding mechanism of diseases or finding drug targets. Methods that infer causal regulators directly from primary data have been proposed and successfully validated in some cases. These methods necessarily require very large sample sizes or a mix of different data types. Recent studies have shown that prior biological knowledge can successfully boost a method's ability to find regulators. Results We present a simple data-driven method, Correlation Set Analysis (CSA), for comprehensively detecting active regulators in disease populations by integrating co-expression analysis and a specific type of literature-derived causal relationships. Instead of investigating the co-expression level between regulators and their regulatees, we focus on coherence of regulatees of a regulator. Using simulated datasets we show that our method performs very well at recovering even weak regulatory relationships with a low false discovery rate. Using three separate real biological datasets we were able to recover well known and as yet undescribed, active regulators for each disease population. The results are represented as a rank-ordered list of regulators, and reveals both single and higher-order regulatory relationships. Conclusions CSA is an intuitive data-driven way of selecting directed perturbation experiments that are relevant to a disease population of interest and represent a starting point for further investigation. Our findings demonstrate that combining co-expression analysis on regulatee sets with a literature-derived network can successfully identify causal regulators and help develop possible hypothesis to explain disease progression.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Spiral - Imperial College Digital Repository

Integrative analysis identifies candidate tumor microenvironment and intracellular signaling pathways that define tumor heterogeneity in NF1

Author: Allaway Robert J
Baker Aaron
Banerjee Jineta
Blakeley Jaishri O
Gosline Sara Jc
Greene Casey S
Guinney Justin
Hirbe Angela
Moon Chang In
Pratilas Christine A
Taroni Jaclyn N
Zhang Xiaochun
Publication venue: Digital Commons@Becker
Publication date: 01/01/2020
Field of study

Neurofibromatosis type 1 (NF1) is a monogenic syndrome that gives rise to numerous symptoms including cognitive impairment, skeletal abnormalities, and growth of benign nerve sheath tumors. Nearly all NF1 patients develop cutaneous neurofibromas (cNFs), which occur on the skin surface, whereas 40-60% of patients develop plexiform neurofibromas (pNFs), which are deeply embedded in the peripheral nerves. Patients with pNFs have a ~10% lifetime chance of these tumors becoming malignant peripheral nerve sheath tumors (MPNSTs). These tumors have a severe prognosis and few treatment options other than surgery. Given the lack of therapeutic options available to patients with these tumors, identification of druggable pathways or other key molecular features could aid ongoing therapeutic discovery studies. In this work, we used statistical and machine learning methods to analyze 77 NF1 tumors with genomic data to characterize key signaling pathways that distinguish these tumors and identify candidates for drug development. We identified subsets of latent gene expression variables that may be important in the identification and etiology of cNFs, pNFs, other neurofibromas, and MPNSTs. Furthermore, we characterized the association between these latent variables and genetic variants, immune deconvolution predictions, and protein activity predictions

Digital Commons@Becker

A Multifaceted Benchmarking of Synthetic Electronic Health Record Generation Models

Author: Guinney Justin
Malin Bradley A.
Mooney Sean D.
Omberg Larsson
Wan Zhiyu
Yan Chao
Yan Yao
Zhang Ziqi
Publication venue
Publication date: 01/08/2022
Field of study

Synthetic health data have the potential to mitigate privacy concerns when sharing data to support biomedical research and the development of innovative healthcare applications. Modern approaches for data generation based on machine learning, generative adversarial networks (GAN) methods in particular, continue to evolve and demonstrate remarkable potential. Yet there is a lack of a systematic assessment framework to benchmark methods as they emerge and determine which methods are most appropriate for which use cases. In this work, we introduce a generalizable benchmarking framework to appraise key characteristics of synthetic health data with respect to utility and privacy metrics. We apply the framework to evaluate synthetic data generation methods for electronic health records (EHRs) data from two large academic medical centers with respect to several use cases. The results illustrate that there is a utility-privacy tradeoff for sharing synthetic EHR data. The results further indicate that no method is unequivocally the best on all criteria in each use case, which makes it evident why synthetic data generation methods need to be assessed in context

arXiv.org e-Print Archive

KRAS mutation and Consensus Molecular Subtypes 2 and 3 are independently associated with reduced immune infiltration and reactivity in colorectal cancer

Author: Beggs Andrew D
Goussous Ghaleb
Guinney Justin
Lal Neeraj
Mason Michael
Middleton Gary
Pickles Oliver J
Taniere Philippe
White Brian S
Willcox Benjamin E
Publication venue
Publication date: 23/10/2017
Field of study

University of Birmingham Research Portal

CRI iAtlas: an interactive portal for immuno-oncology research.

Author: Chae Yooree
Chung Verena
Dang Kristen
Eddy James A
Gibbs David L
Guinney Justin
Heimann Carolina
Lamb Andrew E
Shmulevich Ilya
Thorsson Vésteinn
Vincent Benjamin G
Yu Jia Xin
Publication venue: Providence St. Joseph Health Digital Commons
Publication date: 01/01/2020
Field of study

The Cancer Research Institute (CRI) iAtlas is an interactive web platform for data exploration and discovery in the context of tumors and their interactions with the immune microenvironment. iAtlas allows researchers to study immune response characterizations and patterns for individual tumor types, tumor subtypes, and immune subtypes. iAtlas supports computation and visualization of correlations and statistics among features related to the tumor microenvironment, cell composition, immune expression signatures, tumor mutation burden, cancer driver mutations, adaptive cell clonality, patient survival, expression of key immunomodulators, and tumor infiltrating lymphocyte (TIL) spatial maps. iAtlas was launched to accompany the release of the TCGA PanCancer Atlas and has since been expanded to include new capabilities such as (1) user-defined loading of sample cohorts, (2) a tool for classifying expression data into immune subtypes, and (3) integration of TIL mapping from digital pathology images. We expect that the CRI iAtlas will accelerate discovery and improve patient outcomes by providing researchers access to standardized immunogenomics data to better understand the tumor immune microenvironment and its impact on patient responses to immunotherapy

Providence St. Joseph Health Digital Commons

Colorectal Cancer Consensus Molecular Subtypes Translated to Preclinical Models Uncover Potentially Targetable Cancer Cell Dependencies

Author: Arjama Mariliina
Bruun Jarle
Danielsen Stine A.
Dienstmann Rodrigo
Eide Peter W.
Eilertsen Ina A.
Elez Elena
Guinney Justin
Kallioniemi Olli
Kryeziu Kushtrim
Lothe Ragnhild A.
Murumägi Astrid
Nesbakken Arild
Palmer Hector G.
Ramirez Lorena
Sveen Anita
Tabernero Josep
Publication venue
Publication date: 15/02/2018
Field of study

Purpose: Response to standard oncologic treatment is limited in colorectal cancer. The gene expression-based consensus molecular subtypes (CMS) provide a new paradigm for stratified treatment and drug repurposing; however, drug discovery is currently limited by the lack of translation of CMS to preclinical models. Experimental Design: We analyzed CMS in primary colorectal cancers, cell lines, and patient-derived xenografts (PDX). For classification of preclinical models, we developed an optimized classifier enriched for cancer cell-intrinsic gene expression signals, and performed high-throughput in vitro drug screening (n = 459 drugs) to analyze subtype-specific drug sensitivities. Results: The distinct molecular and clinicopathologic characteristics of each CMS group were validated in a single-hospital series of 409 primary colorectal cancers. The new, cancer cell-adapted classifier was found to perform well in primary tumors, and applied to a panel of 148 cell lines and 32 PDXs, these colorectal cancer models were shown to recapitulate the biology of the CMS groups. Drug screening of 33 cell lines demonstrated subtype-dependent response profiles, confirming strong response to EGFR and HER2 inhibitors in the CMS2 epithelial/canonical group, and revealing strong sensitivity to HSP90 inhibitors in cells with the CMS1 microsatellite instability/immune and CMS4 mesenchymal phenotypes. This association was validated in vitro in additional CMS-predicted cell lines. Combination treatment with 5-fluorouracil and luminespib showed potential to alleviate chemoresistance in a CMS4 PDX model, an effect not seen in a chemosensitive CMS2 PDX model. Conclusions: We provide translation of CMS classification to preclinical models and uncover a potential for targeted treatment repurposing in the chemoresistant CMS4 group. (C) 2017 AACR.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Recommended from our members

Improving Breast Cancer Survival Analysis through Competition-Based Multidimensional Modeling

Author: Alvarez Mariano Javier
Aparicio Samuel
Bilal Erhan
Børresen-Dale Anne-Lise
Caldas Carlos
Califano Andrea
Curtis Christina
Dutkowski Janusz
Friend Stephen H.
Guinney Justin
Ideker Trey
Jang In Sock
Kristensen Vessela N.
Logsdon Benjamin A.
Margolin Adam A.
Mecham Brigham H.
Pandey Gaurav
Rueda Oscar M.
Sauerwine Benjamin A.
Schadt Eric E.
Shimoni Yishai
Stolovitzky Gustavo A.
Tost Jorg
Vollan Hans Kristian Moen
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2013
Field of study

Breast cancer is the most common malignancy in women and is responsible for hundreds of thousands of deaths annually. As with most cancers, it is a heterogeneous disease and different breast cancer subtypes are treated differently. Understanding the difference in prognosis for breast cancer based on its molecular and phenotypic features is one avenue for improving treatment by matching the proper treatment with molecular subtypes of the disease. In this work, we employed a competition-based approach to modeling breast cancer prognosis using large datasets containing genomic and clinical information and an online real-time leaderboard program used to speed feedback to the modeling team and to encourage each modeler to work towards achieving a higher ranked submission. We find that machine learning methods combined with molecular features selected based on expert prior knowledge can improve survival predictions compared to current best-in-class methodologies and that ensemble models trained across multiple user submissions systematically outperform individual models within the ensemble. We also find that model scores are highly consistent across multiple independent evaluations. This study serves as the pilot phase of a much larger competition open to the whole research community, with the goal of understanding general strategies for model optimization using clinical and molecular profiling data and providing an objective, transparent system for assessing prognostic models

Columbia University Academic Commons

Directory of Open Access Journals

PubMed Central

A continuously benchmarked and crowdsourced challenge for rapid development and evaluation of models to predict COVID-19 diagnosis and hospitalization

Author: Aydin Zafer
Bergquist Timothy
Brugere Ivan
Bryson Kevin
Causey Jason
Chen Guanhua
for the DREAM Challenge Consortium
Gao Jifan
Guinney Justin
Jabeer Amhar
Jarvik Jeffrey G.
Lee Christoph I.
Long Dustin R.
Mooney Sean
Prosser Justin
Schaffter Thomas
Wilcox Adam
Yan Yao
Yao Yuxin
Yu Thomas
Publication venue: 'American Medical Association (AMA)'
Publication date: 01/01/2021
Field of study

Importance: Machine learning could be used to predict the likelihood of diagnosis and severity of illness. Lack of COVID-19 patient data has hindered the data science community in developing models to aid in the response to the pandemic. Objectives: To describe the rapid development and evaluation of clinical algorithms to predict COVID-19 diagnosis and hospitalization using patient data by citizen scientists, provide an unbiased assessment of model performance, and benchmark model performance on subgroups. Design, Setting, and Participants: This diagnostic and prognostic study operated a continuous, crowdsourced challenge using a model-to-data approach to securely enable the use of regularly updated COVID-19 patient data from the University of Washington by participants from May 6 to December 23, 2020. A postchallenge analysis was conducted from December 24, 2020, to April 7, 2021, to assess the generalizability of models on the cumulative data set as well as subgroups stratified by age, sex, race, and time of COVID-19 test. By December 23, 2020, this challenge engaged 482 participants from 90 teams and 7 countries. Main Outcomes and Measures: Machine learning algorithms used patient data and output a score that represented the probability of patients receiving a positive COVID-19 test result or being hospitalized within 21 days after receiving a positive COVID-19 test result. Algorithms were evaluated using area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC) scores. Ensemble models aggregating models from the top challenge teams were developed and evaluated. Results: In the analysis using the cumulative data set, the best performance for COVID-19 diagnosis prediction was an AUROC of 0.776 (95% CI, 0.775-0.777) and an AUPRC of 0.297, and for hospitalization prediction, an AUROC of 0.796 (95% CI, 0.794-0.798) and an AUPRC of 0.188. Analysis on top models submitting to the challenge showed consistently better model performance on the female group than the male group. Among all age groups, the best performance was obtained for the 25- to 49-year age group, and the worst performance was obtained for the group aged 17 years or younger. Conclusions and Relevance: In this diagnostic and prognostic study, models submitted by citizen scientists achieved high performance for the prediction of COVID-19 testing and hospitalization outcomes. Evaluation of challenge models on demographic subgroups and prospective data revealed performance discrepancies, providing insights into the potential bias and limitations in the models

PubMed Central

Enlighten

A community challenge for a pancancer drug mechanism of action inference from perturbational profile data

The Columbia Cancer Target Discovery and Development (CTD2) Center is developing PANACEA, a resource comprising dose-responses and RNA sequencing (RNA-seq) profiles of 25 cell lines perturbed with similar to 400 clinical oncology drugs, to study a tumor-specific drug mechanism of action. Here, this resource serves as the basis for a DREAM Challenge assessing the accuracy and sensitivity of computational algorithms for de novo drug polypharmacology predictions. Dose-response and perturbational profiles for 32 kinase inhibitors are provided to 21 teams who are blind to the identity of the compounds. The teams are asked to predict high-affinity binding targets of each compound among similar to 1,300 targets cataloged in DrugBank. The best performing methods leverage gene expression profile similarity analysis as well as deep-learning methodologies trained on individual datasets. This study lays the foundation for future integrative analyses of pharmacogenomic data, reconciliation of polypharmacology effects in different tumor contexts, and insights into network-based assessments of drug mechanisms of action.Peer reviewe

PubMed Central

Helsingin yliopiston digitaalinen arkisto